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60 TCTAGAATAT AGAAGATAAG TTTGCGTACA ATTCAGTCCT T T G AAG ACCT 
GATAAGCTTT ' AAGAAGGAAG ATGGGTTACA C AT T G G G AAA TGGTTGCAAT 
CTGCACATGG ■ CAGAGGCAAG AG AT GC AAAT CACATTTCTT ACATACTCCA 
10 TACAAATCTT AC .AAG AC T GT TTTTCTTTCT CAT T T AAAAT AAG AAG ACC T 
GCCAGTCTTC CCCTTATTAC TAATTACAGT CACTCTGTAT CTTTGTTGAC 
ATTGGATAGT TTTACATACT TCAACAGGCT GGTGTCATTA AAGTTGTGGT 
GGGTGGGCAC CAGAGACACG TGATTCAGAG TGGGAGGAGA TGCAGGAGAA 
ACGAGGCACA GCAGAAGCAG AAGCGAGGAA AAACACTCTC AACGTTACTA 
ACACATCGAG AGGTTCCGCA CACTAGCAAT ACGGGCTGAA TCTGACCTAA 
TCTCTGCTGT TGAAAATTTT GCCTAGCCGC ACACTAGCAA TACGGGCTGA 
ATCTGACCTA ATCTCTGCTG TTGAAAATTT TGCCTAGCCT G T C AC AC AAG 
TGCTGAGCAT ACAGAAAAAG G AG AGTXAT T CTCTGGTTCT TTGACTAACC 
AAATAGTCTA TAT C AAAT T G CCTAAGATAA TGTATACATT TAGTACATGA 
010 CTGGTTATAC CTATTCTATA TGACTATTAT TTAAATGTGA ATTTACAAGT 
GAGCATATGA AGTCCATTTT ACATGGCTAG TACATATAAC TTTTAAAAAG 
TTGGACATAG TTATATTTTT CCATTTATTT ATTTACTTTA TATCCTGATC 
ACAGACCCCC CCCTCCTCTG GATTAACTCT CTCCACTGCT TCTTACCCCT 
CCCCATCTCT CCTTCACCTC TGAGAAGGGG GGATACCTCC TGTCTTATCT 
GGTTTCAGTG GGAGAAGGAT GTATCCTAAC ACATATAATT TTTAATATCC 
TGAGTTTTTC TTTCATACAC CT TACT TAT T CTATTCATTT TTCAGGAAGG 
CATGTTTAAT GTTTTTTTTT TAATTTTATG TGTACGAGTG TTTTGCCTAC 
ACAGTCATAG TGCATCGCAT ACATTTTTGC TGCCCGTAGA GATCAGAAGG 
GAGCATTGGG TTCCCTAGGA CTGGAGGCAT GAACCACCTT GTGGGTGCAG 
AGAACTGAGC CTGGGTCATC TCAAAGCATC AGGTTCTTCT TGAGTCATCT 
CACTTGCCAC TTCTCCCATT TACT GAT TTT ATCTGTGTGC AGACATTCAT 
GGCCCAGTCC ACAGGTGGAA GTCAGGGACA ACCTATAGGA GTCAGTCCTC 
TCCTTCTACC GTGTGAGTCC CTGGCCTCAA ACTCAGGTTG TCGGGCTTCA 
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TAGCAAGAGC TTCTA7TTGT TGAGCCATCT TGCTAGCCCC ACCCCATACT 
AT C T T T AT AA TATCTGTTTA ATTAAGACAT TCATAATGAA TTTTATTAAC 
ATTCATCGTT ATCCCCTTTA CCAATTTTAC TATGTATTAA TTGCCACCCC 
TTTAAATTTA ATTACTTCCT TGGCTGGGTT TTACAGGAGA GTTCCAGGAA 
GCTAGATGGA GAGATGGCTC AAC AG T T TAG AGCAACGGCT GTTCTTGCAG 
AGGACCTAGG TTCAAGTCCT GGC AC TCAG A GGTGGCTCAC AATCATCTGT 
10 GACTTCAGTT CCAGGGGATC TGAAGAATTC TTCTGGGCTC CATGGGCATC 
AAC T AC AC AC TTGGTTCATA GACATACATG CCAGCAAATG ATTGATCCAT 
ACATATGAAA TAAACCATAA ACAGAAAAAA AAAAGGAAGG TGAGGGAAGG 
AAAAAAAGTT TAAAAAAAGG AAAGGAAGGA AGGAAGGGAN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN tJNNNNNNNNN NNNNTCTCTC CATACTGAAA 
GATGTCCACA ATGACTAAGG^GAATTTTTTT TAAAAGACAA GCACAACGTT 
TTCTAGGGAT CAAACTCTAT TTGTGAGGAA GACTGGTGGT ^TGAAGATTA 
CATAGCAGAG T T AC AT C T AA CATGAGCGTG TTTCCCCTGG ATGGAAGGAG 
TCTGATAACT TGTCTTTCTT TCTTAGTTAG CATCTCAGAG TCCCCCGCCT 
CCCTTAACAT CCTTTTTGCA CACCATCTTT T TAG G AAAAT GGATCATTTA 
TGGGGATGTA GTGATTTGTA CAAGAATGTC CCCTGTGGGC TCAGATATTT 
G AAT AC T TAG TTCCCAGTTG GGGGAGCTTT TGTAGGGAGG TTGGGAGGCA 
CAGCCTGGCA GGAGGAAGCA TGCTAGCAGC TTTGAGACTA TAAACCCTCA 
TCTACTACCT TGTTCTCTTT CTGCATTGTG CTGTGTCTGA CACTGTGAGA 
TTCCTGCTCC CGATGCCATG CCTGCCCGCC ATGATAGACT CCTAGCCCTC 
TGGAAAGGTA ACCTCAGTGA ACTCTCTTCT ATAAGTTTCT TTGCTCCTGG 

Hindi I I (-4200) 
TGTTTTATCA CTGAAACGGA AAAGCTT GCA GGGAGGTAGG" AGGCAGCCTG 
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BstZII (-4100) 

TGGCGTTGA7 TCAATGCACC TGGCCTTATC C T C G GAT G AG ATC GGTCACC 
AGTCAAAAAC TGTGAGCTTG AAGGTCTTGG GTGCTTAACA TCTATTTTTA 
CAAATCTTAT TTAGCAACTT AGAACTGTGA AATATTGGAA AGCTACTTAA 

-4 010 AC C T TCT AAA CTCCCTCCTC C AC AC TAT G A GAATGTTACA TTTTCTATTC 
AGTTATTTTT GAG C AG T AAA CAGATGAATC AAGGAATATG CCCATCACAT 
CAAGAGTGCT CCTAAATGGA CTTGCTTGTT ATTCATTTAC AGTGTGGCCC 
CTTGACTTTC ATCGGCACTC CTAGCAGAAA ACAAAATCCG CCAGATGGAG 
CTGGAGAGAT GGCTCAGCTG T T AAGAAT AC TTATCCCTAC ACAGGCCCTG 
GAGCCAGTTC CCAGCACCCA CACGGTGGCT CACAACCATC TGTAACTCCA 
GTTCTAGGAG ACCCGACTCC CTCTTCTGTC TGAAAACACC AGGCACGCGT 
GCGGTCTACA TACAAACATG AAAGCAAAAT ACACACATTA CATAAATAAA 
TCTTAAAAAA TGATTCGGGG TGGGGGAAGG AAAAAAAAGG AT GT^ AG AAA 
ATCGATGTAA CTGTTTTTTC CTTTTGCACA GATCTAAGTT AGGGAAGGAG 
AACATTCTCT TACCATCGAA AATAATTGTT TTCATTGCCC CCAAGTCTGC 
TAATAGAGCT TGCTACCTTC ATGGCTGTCG TAAGGATGAG GCAAAGATGG 
ACTTCAGCTT TCAGACTGTG TCTGCTCAAA TGTTGGCTAC TCCTGTTTTC 
TGACCCCCTT CTCTGGTGCA ATGTGGACTT TCAATTAATT TCCCTGCATC 
TTTTACATAT TTGATTTAAA AAATATTTTA TTTTATGTAA TTGTATGTAT 
ATGCATGTCA ATAAGCATAT GTGTGTGTGT TTCCATGGAA ACCAAGGCAA 
CAGATTTTCC AG AGCT G TAG AAATGGGCTG TGAGACGCCC ACTGTGGGTG 
TTCGGAACCA AACTCGGGTC CTGTGGAAAG ACAGCGAGCA CCCATAATGC 
AGAGGTATCT CTCAGATTTT ACTTTAAAAT TTCAATTTTC TTTTTTTTTT 
TTAAAGTTCC AAGTAACTAT AGGAAAGTAC ATGGGTATAT AGATCCCCAG 

-3010 TACCAAGATT CTTCCTTTGC AGGTAGCACA ACTTGGTTTG TTTCACATAA 
AG AAT G G AAA GTCATTAAAA CACTCATCAC ACTGTAAAGT AGAATTGAAC 
TCTGACAGAA CAAGCGAAGT GAGTCTGACT TCCAGGTAAC TGAGCCTTCT 
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TTTCCTCCTA AAGACACAAG CCATACACAG AGTAAAATAA ACTTGGGCAT 
GGT GAG AAGG AAACAACGCA GGAGC-GCTAG CCAAGTCTGA GAGTCGTGAG 
TGTGCTCGGT TTA7AAACGG AGCCCACCTT GCCAGCGAGG TAGTCACATG 
CTCTGCTAAA CAGAAAC TT A AGAAAACACT TACACGAAGC AAACATGGGG 
AAGTGCCATG CAAGCATGTG ACTGACTGGT GGCAATGACC GAAACCACAG 
CAGCCACTAG AAAAGGAAGG GTAGTGCGCC ACACTGTAGT TGTGAAAATG 
AACTTATTCA TTTATTTTGA AAAACGTGTA AGAAGCAAAG ATGTCTTCTT 
TCCCACCTAC CTTTGCGGCA GGCGAGCACT TCCTGGAATT TATAAAGTGC 
GATCTTTCTG GGGACTTCTC ATAACATTTC CTACTGCTCA TCTATGTCTG 
TGTCAAATAG AGAATGCTCT TGAACAAGTG TGTGTGTGTG TG.TGTGTGCG 
CGCGCACGCG CACTCACTCC TGCTCTGTTG AGGTCCAGTT TTGATGGTCC 
CGCCAGAGGT ATATTTGAGT ATCATTTCTC AAGAGCTTCA GCTGGGAGAC 

■Hi 

ACTGCCTCTT ACTGGCCTjGA AGGTCACTAG CTGATTCATC* TCCGTTTGGG 
CTGGCGCGCC TTGGGGATCC TCCTATCTCT CCTTCCCCAG TGCTGGGATA 
ACAAGGTTGG CACCACATGA GCCTTTTAAA ATGTGAGTTT GGAAGCT CAA 
ACGCAGGTTT TCATGCTTGC ACTGAAACTT CACAAGCTGA ACCGTCTCCC 
TCTCCTTCCC TCTCTTTTTT CCTTTTCTTC TTCCTTTTTA AAACACATCT 
10 TGTCTTTAAA AAAAAAAAAA GGCCCAAAAC AAGTGTAAAG TATTTCCCTA 
TGTGTGTGGA GGGAGGG AG T ATAGGAGGCT GATTTCACTG AGATCCTGTT 
AAATTTGGGT GCCATAGCCA ATCAAAGACG CATCGTTTCC TCTAAGAATT 
CTAAATGGGG CGATTACCAC GGGCCTGCAG GTTCTGGTTT GTATTAGAGG 
AG AC AC T GT C TTCTTAAGTA AAACATAGAA GGGGAAGTGT CCAGAATTGT 
AAATAAGGCT TC G AG AG AAG CCTTGTCTGG CCACCGGGAT GGAGAAGACC 
TACCTTCGCC TATCCAGGAT CCATCGTCCC TCCCTCTACC CAGATCTGAC 
AGCCCTCCTT GGCTCTTTTG CTGAGGTTTG TTTGAGTTTG TTTTACTCTC 
TGCAAGAGAA GTTTCCTTAA ACATTC7ACC CTGTTCACAA GTAAATACAC 
CTCTTAGCTA AGAC-GCCACA CACCCAGGGG GAACACCGAT AAAAAGAACA 
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AGCCAGAACC TTCAGAACGC TGTCGATAGG TACACCAAGC AGCCTTCA7A 
CGGAGTTTTC ATTCGTGAGG A GC T G AAT AT ACAACAAAGC TAAATGTGAG 
CAGACCAGGC A.TGCCTCTGC TAAATGAGGA TGCCCACACC AAACATGCCC 
AAGATCTTCA AGTATAATTT TATTATATAG ATTCGCTATG TGTTGACAT'G 
TTTTTATAGT GAACCTGGAT TTTACAAACC CTCCTGGTTT GCCACCTGCT 
TCTGGCACCA TACTTGAGGC TTAGGCACGT GATAAAGGAG CATGCCTGTT 
TCCCCCCTTA TTTTTTTTAA AG AAAAG C AC CATGTTACAT CATTAATCAT 
GCATATCAGT GTAGTTTAGA TCCGATGTAG AGACAATAAT CTTATCTCTT 
TGTCTGGCTG AAAGACTGTC CTTTAAACTA TCATTCTAAA TGCATTTGGT 
TTTTGCCAGG AGTAAAACAT GTCACAAGAT ATTTGTTGTC ATTTCCCAGG 

-1010 CGTGGAAGGA AAGGAATGGA AAGAAAACCA GGGGTGAAGG CTGCTGTTCC 
TCTCTAGTCG CTACTTGAAG T C TAG A TAG C TGGGGGGGGG GGGGGGACTG 
TTCACATGGG ACCGGTTTCC TCTTTG.TTCC TACACTGGCG CCTCTGGCAA 
AAAACTCTCC CTTCTCTTCC CCCCAAGCAT ATCTTGGCTG AAAGGTCAGC 
TCTGAAAAGG GGCCTGGCCA AAGTTACTGT AGGGGACCGT GGTCATGGAA 
CTGGGTAAAC AAAAGCACTC TAGCAGCCAC TGGAAAAGGA CCGGGGGCTC 
TTCTCTGTGC ATTTGCCCTG GAACCCTGAC CACCGCCAGC TCCCTGCATC 
TCCTTGCTAT GGGTTTTCTG GACCGACCCA GCCAGGAAGT TCACAACCGA 
AATGTCTTCT AGGGCTAATC AGGTAACTTC GGACGATTTA AAGTTGCCAG 
AT GG AC GAG A AAACAGTAGA GGCGTTGGCA ACCTGGATAA GCGCCTATCT 

-510 TCTAATTAAA AC AT T C AG AC GGGGCGGGGG ATGCGGTGGC CAAAGCACCA 
TAAAACAAAA CTTCCAAGTA CTGACCAACT CACTGCAAGT TTGTGCCCCG 
AGTACATCTA GGTTCAGGGG TTCTTGTCTT CATGCTCCCA ACTGCGGGCG 
GATTTTTGGT CCCTTGGGAC TTTCAGTGCA GCGGCGAAGA GAGTTCTGCA 
CTTGCAGGCT CCTAATGAGG GCGCAGTGGG CCTCGTGTTT CTGGTGATGC 
TTCCCAGGTT GCTGGGGGCA GCAAGTGTCT C AG AG C C C AT TACTGGCTAC 
ATTTTACTTC CACCAGAAAC CGAGCTGCGT CCAGATTTGC TCTCAGATGC 
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GACTTGCCGC CCGGC AC AG T TCCGGGGTAG TGGGGGAGTG GGCGTGGGAA 
ACCGGGAAAC CCAAACCTGG TATCCAGTGG GGGGCGTGGC CGGACGCAGG 
GAGTCCCCAC CCCTCCCGGT AATGACCCCG CCCCCATTCG CTAGTGTGTA 
+1 (transcription start) 

-10 GCCGGCGCTC TCTTTCTGCC CTGAGTCCTC AGGACCCCAA GAGAGTAAGC 

TGTGTTTCCT TAGATCGCGC GGACCGCTAC CCGGCAGGAC TGAAAGCCCA 

GACTGTGTCC CGCAGCCGGG ATAACCTGGC TGACCCGATT CCGCGGACAC 

CGCTGCAGCC GCGGCTGGAG CCAGGGCGCC GGTGCCCCGC GCTCTCCCCG 

GTCTTGCGCT GCGGGGGCGC ATACCGCCTC TGTGACTTCT TTGCGGGCCA 

VRE • f 

GGGACGGAGA AGGAGTCTGT GCCTGAGA^ C TGGGCTCTGT GC CCAGCGCG 

AGGTGCAGG A TGG AGAGCAA GGCGCTGCTA GCTGTCGCTC TGTGGTTCT^ 

CGTGGAGACC CGAGCCGCCT CTGTGGGTAA GAAGCCCACT CTTTAGTAGT 

AAGGCGGAGA AGTAGGGTGC GGGCGGAGAG TGGGAATAGA AGAGGACCTA 

ACTCGTAGAG CTCTAGAGAC CCTCCTCCCT TGGGTGTTCT TTCACTTACC 

+ 4 90 AATGGGGAAA CTGAGGTTCA AAGACTCTTC CGAAATGACT CAGCCAGGAT 

TCTACTCTCC CCCGGGCATC GGTTGGAGCG TGTCCTGCGG AGCCGTCACA 

GCCCCTGGCG CTAGGTAGGC AGGAGTGGAA AGGCGGCCTG AGCCGGGGCA 

GGAGATGCTC CCACTGGCAG GAACAGGCGG TCAAACGCTG GGAAGCCAGC 

TCAAGCCAAG CGGCCCGGCT GGCATCAATC ACTCGGTGCT GTTGCCCACC 

GCCCTAGTGG GGGGCAGGGA ATCCGCCTCT GGCTCCGCTC CCCTTTAGCT 

CCAGCGTGTA AGCGCACGGA CTATGTGAGG GTAGGTCTCT T CAT AG AG C A 

ACACTTTCCT CCCTCAACTT TCTTTGATGC AGAATGCTAT TTTTGCTGGT 

AGGAGGAAGA CGCGGCTTTC TCTTCTGTGA CAGCTTCTCC AGGTGTATTA 

AACTAAATAA CTCTCCACTT ACCGACTCCA AAGCGCTGGT CCTGGGGTAA 

+ 990 ACTCTGAAAG TCTCAGAAAC TCTTGAGCTT GGCACCTAGT TATAGGTCAC 

TTTTCTTGTT TTAAAATGCC CTCTGCTTCA AGGTTAGGCC CACACTCGCT 
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CTTGGGCTTT TG7GCAATAA TTTCCC7TCC CTTCCCTTCC CTTCCCTTCC 
CTTCCCTTCC CTTCCCTTCC CTTCCCTTCC CTTCCCTTTC CCTCTTCCTT 
TTCCTCCTCC TCTTCCTCCT CTATTTCTCT GTCATTTCCT TTTTGAAGCC 
ACAGTTTGCA GAT7TCCAAT CTCCACCCAT TGGAGAATGG AGAATCAGGA 
AAAAAGAAGT CAATTCTGCA GAAACATTCC TTGCGCCCTA AGAGAATCGC 
ATGGCTTAAA AGCATTGGCA CTGACATACG GCGCCAAGAT CGCCTGTCTA 
GAGCTATTGA GTTTTCCTCA TAATGACTTG GTTCATCAGG CTAGCTCCAC 
CACGAGTGCC CTCTTGTTCC TGAGAAGGCC GCACTCTCCC CCTTTCTGGG 
AAGAGAAAGA CAGCCTGGAA CATGTGCTTG CCCTGGGTTC CATAGAGAAG 
CAAGTTGCTT TAAAGCCCAG AGAATTCCTA GTGTAGCAGC TTAACAGCGT ■ 
CCCGTTCTCT GAATAAGATG GAGGTTGCCC TTTTGGAGTG TGTGACTTGC 

; Xhol (+1600) . * 

f 

* TTAATTGGAT TGGGCTATAA TTGGTGCCAT CCAAGT CTCG AG ACAGAGCC 
GCTGTTGTTT TTCCTTCTGG TCTTTGAGCG GGAAGGATAA CAGTGCACAA 
ATTAATTAAT GTTGGTTATC GGATTTGAAC ATAAAAGGGC TTTTATTGTA 
TAGTAGCATA TGTACCTCTT GCAGTCAGAA TGAGCTGTCT AAAGAACAGA 
ACCCAAACTT GCCGATGAAA ATGAATGAGG TTTAATAAAG GCGATGGATG 
AGCATTAGTC ACTGATGTAA ATCTCCAGTT ATTGATAACC TCATTGACTG 
GATTTGATTG CAGACATGTA TTGGTATGGG GCATCCTTTA AAGATGAGCA 
+ 1990 TAGCCAACGT GCCTGCACTC TAAGAGAATC TATGGCTGTA TGTTATTACA 
GAGACAGTTG AGAAGCTCTT AGTGGCTCTG GCGTGTAGAT CAGCGGTAGA 
GCGCTGAGGC TCTGCGCTCG CTTCCTGGCA CTGAAGAATA AAGGCCATTT 
ACTGTGGTGG TGCAGTGGGC GCAGTTTGTG ACGAGTTACT ACTACATTTT 
CCTCACACAT CTGCC TGACT AA TGAGTTCA TCAGATGAGC GTATCCAGTG 
ATTGTTTGCA GGTTAATGGT TCTCAGTCAT GTTTAGAATC TACTTATCAA 
ACAAATTGTT TTCTCATTTC CTGCTTCTTC TCAAACAAAG TAAGATTCCA 
TTATTGAAAG GCTTGTTTAA GAGCATTTTA ACTGCTTGCC TATGTTAGGG 
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AC AG T G AC T T AT 7 7 CAT AT T GA.CAAATATT ATGCCGATTA ATTGAATATG 
ACTACCCAGT TC7ATAGCTG TCTCAGGGCA GACCAAGAGC ATCTGTGATC 
CAGTCACTTT AAA7GCCATT TAAAATGCAT AATTTGTTGG TCTAGGAATA 
AACACACTGT AAAG T T TAG A ATCACGGCCC AAACACAAGT CTTTAACAAT 
GCCAACTAGC TTCTGAGATT CATTAATGTC ATTTAATTAC CAATGTTTTA 
AAAATATGTC ATTAATTACT AAATCTATAG TTGTAACAGC AACACATGTA 
CATCTTATTA AGTTGGGTAT ATTCAGGGTG GCATAGCTGT AGACTATTGC 
ACATCTGTGT TGGTGAGCCA GTGGAGAACT GCCTCCTGGC TGTTCTCAGA 
AGGCCACAGT GTCACGGCAT TGGCTATTTG CCTTGGCTCT TTGCTAATAC 
TTTATTGACA TGGCCTCATC TTCGTTCACG TTCACTTATT TGCCCAACAA 
CGTCAATGCC AGCTGAGGCC TTAGGAGTCA TCTGTTCTTA GTCAGTGCGA 
AT TAG AAAG C CTGGATGCCT GCCTGCTATT AATTAGTTAT TCTTCTCTTC 

f 

+2990 TGAGACAGAG TCTCACTGTG TGGCCOAGGC TAGTCTCAAA CTTGCGGTCC 
ATTTGTCTCA CT CAT C AG AA TGCTGGGCTT CCAGGTGTGT GCACCACACT 
AGGTAGCTCG CGTTTTAAGC TAAGAGCTGG AAGATCCTGA TGTCCTTTAC 
CATGGTGGGC ATGTTACAGG TTAGTTGACT GAAAAC T AG T TATCTCGCTG 
TGTAATGACC TGCAGTGGTA TGTATCTCTC AAGATGCTTT TTTGCATTTC 
AATCAGTTAG GTAACAAGTT CTTAAGTCTC CAGCTTGGTA TTGGCATGAG 
CTCAGAGCTT TGATTAATGA GTTGGGACCC CCTAGCTATT GCTCATTAGA 
CTTACACTAT TTTTAGTTTT GCTCTGAGTT TATGAATATG CATGTATGCA 
TGAACTTGGG AGATATTTTT CTTCCCCAAT TCCTTTTCCT C CAT T T AAAT 
GTGCTGTCTT TAGAAGCCAC TGCCTCAGCT TCTGCAGCTC AG AT AC C AAA 
GGAAGTCTGG TACACAGCA T GATAAA AGAC AATGGGACGG GGTCACAGTG 
GCTCCCGTCC CTTTCAGGGG TATGGAGACG AGC TGT AG AG AGATGTCTCC 
AGGGAGTTTT CAT7AATCAG CAATTTAGTC AGATCTGTGC ATCCTATGCT 
T T AC AAG AAA TGTCAGTGGG CCTGAGATCA TCAGATGGAG GTTCATCGGG 
TTTCAATGTC CCG 7 ATCCTT TTG.TAAGACC TTGAAGTTGG CAACGCAGGA 
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AAAC AG G AAC TCCACCCTGG T G C C G T G AAT TGCAGAGCTG TTGTGTTGGT 
TTGTGACCAT CTGCCCAT7C TTCCTGTTAT GACAGAGCTT GTGAACTTTA 
AC T G GG ACT G GGGCAAAGTC AATCCCACCT TTATACAATG AATTGCTGAA 
GAGGCCTTTT AAAACTTGGA GTGTGCATTG TTTATGGAAG GGCTTTCCTA 
BamHI (+3900) 

TTGGATCCAA CTCTTTTCTA ATTTGTTTCT AGGTTTGCCT GGCGATTTTC 
+ 3990 TCCATCCCCC CAAGCTCAGC ACACAGAAAG ACATACTGAC AATTTTGGCA 
AAXACAACCC T TCAGATTAC TTGCAGGTAA GGATTCCTTT TTGAGCCAGC 
TTTCCTATGT GAAAGGACTC ATTGTTTACT GAGGTCACAA CAATTTCCAC 
TATTGCAGAA GTATAATAGT ATTGTTACAA TTGTTTATAA ATCATGAGAC 
TTCTAAGAAC CTATTTAATA ATGAAACAAT GGAAAAAGTC TTTTCAAACC 
TTTGTACTCT TTTGCTGAGC CGTTTTCAAC ATGCACAAAC AT AT T ACAcX 
AATATAACAT ACACAGGAAC ACACATGAAT GCATGGGATG ATGTGCCf AA 
AACTAGCATG TAATTGATAT TC AC AAT TAT TGATAAATTA GTAAAGCAAA 
GGAATTCCTT ATGAATAGAG CTAAAATTCT ATCCATGTTC AAGTCACCCA 
GAATGGCTTC TGGACATTTT TTTTTTTAGC TGTTTTCTAC AAGTGAAATT 
CTGCCTGTAT TAGCAATTTA ATATCTAGCC AAT AAT AT TC CTGACCATAT 
GTCCTGTTCA GACCATG AC C TTCATAATCT GGCTTGATGT TCTGGGCTTC 
TTTCCCTCTT GCCAGCAAGA TGTCACGGTG TTGATGCTGG ATAAACTGAG 
AAACAGAAGT TTTTCGCAAG AAGAGGACCT TGAATTTTGC TTTTCCCCTG 
AGAGACAAGA AAGGAAACTT AGAGGAGGTG TAGCTGGGAG TGTGGTCATT 
CAT G AAAG AC CTGTTTGCAG GGCAGTGTGT TTTGCTGGGG ACAGTAATGA 
GCCTAGATCG TAGTGCCATC CCAAGAGAGT GCTTGGTGGC AAAAAGAGCC 
CTAGCAGCTT GTGGCAGTTG CCTCATATTT GAAGAATACT AAGAGGTCCC 
C C G AAT AAC T CAGGGCTAGT GTTGATCATT GCATGTGGAG AGAATCCAAG 
CCTCCTATCT AGGGTCTACA AAAGTAACCA ATGCCCAGTC TTTGGGGGAA 

Figure 1 continued 



10/21 

+ 4 990 AGCAAAACCA GAAAGCGATG AT AG C AG G AC CTGTTTATTT TCATTAAGTC 
ATGGCAT7TC C AG AG AC T T T GCTCCCCCTA T T C T C AG AC A CAAAGCCCAC 
TTAAGATCTC CCTCTGGAGA CTGCTGGGAA CATTTCT TAA GT 7 C T GAAAA 
AACCCTGGAG TGATTGGGCA CAGACGATCC TGTCACTTCA TGTGAGTGCT 
AAGCTCTTTG GGTGATGACT CAGTGGGTCA CATTGTTTTA TTCATATTGA 
CTACCTTCCG TTTGCTTTGC GGAGAATGGA AGCTATAGAA GTCTGTTTGG 
TGTGGCCCTC ACAAGGCACT GTGAGCTTCT TCTCTCTGTG TGCTAACTTC 
TTACTCTCCC TTGCTTATAC CCACATAGGG ACTCTGGCTT TGTTGCTGTT 
CTTCAATGCT TCAGATGTGC CCTGGGTCCT GTCTGTCCTT CACACTTACT 
GATGCTGCCT GGAATGCTAT TCCTCCCAAT GTGCATAGGG CCAGCTCGGT ' 
CCAAATCCTC TCTTTTCTTT GCCTCTT^TA TATTTTCCTT C AC AG TAT C A 
AATCACCACA GTTTATGCAA CAAACTQAAA CTTTAAAATT GTCTGTCTGC 
TTATATTAGT GATAGGTTCC AGAAAGGCAC TGATTTTTTT TCTTCCCT GG 
TGTACACTGG GCAACTACTC TACCACTGAG CGTGATATCC TTGGTCCCTT 
AAAAGTTATC GTCTGTCCTT AATAATGCTT AGCAATCATA TTTGCTTAAA 
ATATTTATTG AATGACTGCA GGAATGAATG AATGAATGAG C T AACAG AAA 
ACTCATGACC ATGTGGGTGA TTTCCGAAAC AGAGTGTGAG ATCTTTGGTG 
GCATGTCCTT GTAGACTGTC TGCCACCAGT ATCTATCATC TTGAAGGTGA 
CTATTGAGTA GTTTATATGC ATGTGAAAAA CCAAACCTTC TATTCTCTTA 
CTCATAGCCT CTCTTAATCA TAGCCCTGTG GCATGGAGTG TACCATTGAT 

+ 5 990 ATCTTCCTGG AATACTTTTT CAGGGGACAG CGGGACCTGG ACTGGCTTTG 
GCCCAATGCT CAGCGTGATT CTGAGGAAAG GGTATTGGTG ACTGAATGCG 
GCGGTGGTGA CAGTATCTTC TGCAAAACAC TCACCATTCC CAGGGTGGTT 
GGAAATGATA CTGGAGCCTA CAAGTGCTCG TACCGGGACG TCGAC (SEQ ID NO: 1) 
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GATA PEA3 

AAATGTGCTGTCTTTAGAA^ 65 

GATA API 
PCAC AGCATGATAAAAGACAATGGG ACX3GGGTCACAGTGGCTC 130 

NFkB API 
GACX^AGCTGTAGAGAGATCTCTCCA 195 

STAT SCL/TAL-1 
TOCT ATGCT TTAC7VAGAAATGTCAGTGGGCCTGAG ATCATCAGATGGAGGT TCATCX3GGTTTXIAA 260 

Ets-1 GATA EtS-1 
TGTCCCGTATCCTTTTGTAAGACCTTG^ 325 

"SCL/TAL-1 EtS-1 

GCCGTGAATTGCAGAGC^ 390 
GCTTGTGAACTTTAACTGGGACTC 455 
AGGCCTTTTAAAACTTGGAGTGTGCA^ 511 
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